A Large-Grain Parallel Sparse System Solver
نویسندگان
چکیده
The eeciency of solving sparse linear systems on parallel processors and more complex multicluster architectures such as Cedar is greatly enhanced if relatively large grain computational tasks can be assigned to each cluster or processor. The ordering of a system into a bordered block upper triangular form facilitates a reasonable large-grain partitioning. A new algorithm which produces this form for unsymmetric sparse linear systems is considered and the associated factorization algorithm is presented. Computational results are presented for the Cedar multiprocessor. Several techniques have been proposed to solve large sparse systems of linear equations on parallel processors. A key task which determines the eeectiveness of these techniques is the identiication and exploitation of the computational granularity appropriate for the target multiprocessor architecture. Many algorithms assume special properties such as symmetric positive deeniteness or exploit knowledge of the application from which the system arises e.g., nite element problems. In this paper, we give an overview of the use of a new ordering technique, the hybrid ordering (H*), and an associated factorization algorithm for unsymmetric unstructured sparse linear systems. More detail on the reordering can be found in 7] and on the merger of the reordering and the factorization algorithm for multicluster architectures in 4]. 1. The Hybrid Ordering. The hybrid ordering H* is composed of two diierent types of orderings: unsymmetric and symmetric. The unsymmetric ordering changes the associated graph of the matrix, mostly by row or column interchanges. The symmetric orderings only relabel the nodes of the associated graphs and maintain certain properties of the system, e.g., symmetry, diagonal dominance. The symmetric orderings are used to obtain a bordered block triangular matrix. The unsymmetric ordering is used to enhance the numerical properties of the matrix.
منابع مشابه
Performance of a Fully Parallel Sparse Solver
The performance of a fully parallel direct solver for large sparse symmetric positive deenite systems of linear equations is demonstrated. The solver is designed for distributed-memory, message-passing parallel computer systems. All phases of the computation, including symbolic processing as well as numeric factorization and triangular solution, are performed in parallel. A parallel Cartesian n...
متن کاملUn modèle de programmation à grain fin pour la parallélisation de solveurs linéaires creux. (A fine grain model programming for parallelization of sparse linear solver)
A fine grain model programming for parallelization of sparse linear solver
متن کاملpARMS: A Package for the Parallel Iterative Solution of General Large Sparse Linear System ∗ User’s Guide
For many large-scale applications, solving large sparse linear systems is the most time-consuming part. The important criteria for a suitable solver include efficiency, robustness, and good parallel performance. The Parallel Algebraic Recursive Multilevel Solver (pARMS) [8] is a suite of distributed-memory iterative accelerators and preconditioners targeting the solution of general sparse linea...
متن کاملA Parallel Frontal Solver For Large Scale Process Simulation and Optimization
For the simulation and optimization of large-scale chemical processes, the overall computing time is often dominated by the time needed to solve a large sparse system of linear equations. We present here a new parallel frontal solver which can signi cantly reduce the wallclock time required to solve these linear equation systems using parallel/vector supercomputers. The algorithm exploits both ...
متن کاملA Parallel Frontal Solver For Large Scale Process Simulation
For the simulation and optimization of large-scale chemical processes, the overall computing time is often dominated by the time needed to solve a large sparse system of linear equations. We present here a new parallel frontal solver which can signiicantly reduce the wallclock time required to solve these linear equation systems using parallel/vector supercomputers. The algorithm exploits both ...
متن کامل